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ABSTRACT 

Gastric cardia adenocarcinoma (GCA)is one of the few malignancies with unexplained reasons that have increased 
sharply in developed countries. The purpose of this review was to determine the pathways in GCA to identify new 
biomarker. So by comparing gene expression in GCA group with normal control identified important pathways. Gene 
expression data were extracted from the beforehand investigations then differentially expressed genes utilized in DAVID 
program to explorer and find related pathways. Our findings contain 367 gene names. Out of these 367 proteins, 199 
were found to be exclusively expressed in GCA; whereas 168 proteins were detected down-regulated or silenced. The 
GCA associated diseases based on the differently expressed genes made up of diseases pathway related colorectal 
cancer, small cell lung cancer, breast cancer and H. pylori infection stomach cancer. KEGG pathways related to GCA 
contained cell cycle, p53 signaling pathway, DNA replication, toll-like receptor signaling pathway and some other 
diseases. The GO-discovered categories also demonstrated most biological process and molecular function related to 
cancer. Up until now, there is no report to infroduce influential biomarkers in GCA so, the deregulated genes identifled 
in GCA patterns might be helpful for diagnosis, prognosis and therapies for gastric cancer but validation of these 
biomarkers is necessary. 
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Introduction 

Gastric cancer is the second most common 
cause of cancer death worldwide (1). Because 
the most patients are diagnosed at an advanced 
stage, where treatment options are limited, the 
5-year survival rate is below 20%. Indeed, 
precancerous lesions are often difficult to 
differentiate from gastric carcinomas in biopsy 
samples by conventional histopathologic 
analysis. In fact, experienced pathologists often 
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disagree in distinguishing invasive carcinoma 
from high grade dysplasia in gastroscopic 
biopsy specimens. Currently available 
serological tumor markers, such as carcino 
embryonic antigen (CEA) or carbohydrate 
antigen 19-9 (CA19-9) (2-5), for the early 
detection of gastric cancer have not enough 
sensitive and specific. So new early stage 
detection techniques, treatment options and 
knowledge about the molecular mechanisms of 
gastric cancer are needed. In this review, we 
focused on gene expression to improve the 
understanding, diagnosis, and follow-up of the 
progression of GCA. 
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Gastric cardia adenocarcinoma 

More than 90% of mahgnant tumors in the 
stomach are adenocarcinomas and less frequent 
tumors of the stomach include lymphomas, 
carcinoids and gastrointestinal stromal tumors 
respectively (1). Gastric adenocarcinomas are 
biologically and genetically very heterogeneous (6) 
that tumors arising in the stomach may have distinct 
etiologies in which gastric cardia adenocarcinoma 
(GCA) show biologic, epidemiologic, and clinic 
pathologic features more closely resemble 
esophageal adenocarcinomas (EACs) than gastric 
noncardia adenocardinoma (GNCA). Because, the 
cardia anatomically lies between the end of the 
esophagus and the body of the stomach, genetically 
reported also, the P53 mutation spectrum in GCA 
more closely resembled EAC than GNCA (7). A 
number of other genetic alterations have been 
reported in gastric cancer, including CDHl (8), P- 
catenin(6),TFFl (9), and Met (10). Further, 
canonical oncogenic pathways such as E2F, K-RAS, 
p53, and Wnt/p-catenin signaling are also known to 
be deregulated with varying frequencies in gastric 
cancer (11, 12). 

Epidemiology and risl< factors 

GCA is one of the few malignancies that has 
increased sharply in developed countries in recent 
years for reasons that are as yet unexplained (13,14) 
the wide variation in incidence across different 
geographical areas and higher proportion of GCA 
are two main characteristics of gastric cancer in Iran. 
It is the most common cancer in north and northwest 
Iran. A high prevalence of H.pylori infection, high 
dietary intake of salt and smoking are the main 
environmental factors of gastric cancer in Iran. 
Gasfroesophageal reflux disease, red meat and dairy 
products are other contributing factors in populations 
with a higher incidence of GCA (15). Increased age, 
male gender, a family history of upper 
gastrointestinal tract cancer, thermal damage from 
hot food (16) have all been consistent risk factors for 



GCA (16-24). A recent genome-wide association 
study of germline DNA found a common gene 
(PLCEl) associated with risk for both GCA and 
ESCC (25). Wang et al. identified genomic 
differences between gastric cancer by anatomic 
subtypes to development of appropriate targeted 
strategies for early detection, prognosis, and therapy 
(26). So, the aim of this study was to investigate 
GCA at the molecular level using pathway analysis, 
therefore, this makes relatively easy to find candidate 
marker proteins for GCA. Beside on study based 
systems provide convenient biomarkers of 
diagnosing cancer as well as drug targets. In addition 
to tumor therapy requires information about 
molecular alterations, thus we studied different gene 
expression of GCA in literature. 

Gene expression data from GCA 

During the last decade, there has been an 
exponential increase in the number of studies 
analyzing cancer tissue; in this study, data were 
extracted from Wang et al. investigation (26). All 
genes were significantly altered (up-regulate, down- 
regulate, silent and new expression gene) between 
gastric cardia adenocarcinoma tissues compared to 
normal tissues. For GCA, a total of 367 genes were 
differentially expressed between tumors and their 
matched normal samples. Of these genes, 199 genes 
were up-regulated and 168 were down-regulated. So 
according to these data, some analyses to find out 
early diagnosis or drug targets were performed. 

In order to carry out a refrospective meta- 
analysis of the functional annotations using UniProt 
accession numbers (http://www.uniprot.org), a 
pubHcly available web-based tool, to search for 
annotations that are significantly associated to the list 
of GCA related proteins. 

We also used DAVID Bioinformatics Resources 
6.7 (the Database for Annotation, Visualization, 
and Integrated Discovery) (27), 
"http://david.abcc.ncifcrfgov/", a comprehensive 
set of fiinctional annotation tools for understanding 
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the biological meaning behind large hsts of genes, 
to obtain gene ontology and KEGG pathway 
information for differential genes between the GCA 
and control pattem. Differentially expressed genes 
that were similar and different between the GCA 
and control were compared. The DAVID gene 
classification tool functionally analyzes a large 
number of genes in a high-throughput fashion by 
classifying them into gene groups based on their 
annotation term co-occurrence. This classification 
is accomphshed with p-value(EASE Score, an 
alternative name of fisher exact statistics in DAVID 
system, referring to one-tail Fisher exact 
probability value used for gene-enrichment 
analysis) for each enriched annotation terms. 

Cancer cells do not invent new pathways; they 
use pre-existing pathways in different ways or they 
combine components of these pathways in a new 
fashion. Gene clustering based on fimctions illustrate 
correlated expression patterns (28,29) and analyzing 
the gene-expression data might reveal the 
organizational pattem of gene expression in cancer, 
which might, in turn, help us to identify new 
potential drug targets; so, in this study have been 
utilized classification software to mining gastric 
cancer data. The diseases related to expressed genes 
using GENETICASSOCIATIONDBDISEASE 
was detected the most diseases related to the 
expressed genes in GCA pattem versus control made 
up of diseases pathway related cancer (BIRC5, 
BUB IB, APOE, IL8, MMP7, PCNA, TFF2 and...), 
colorectal cancer (CD14, ADHIB, AURKA, CTSB, 
FAM46A, GSTAl, GSTA4, APOE, IGFBP3, IL8, 
MMP7, MMP9, MDK, PLA2G2A, PLAU, PCNA, 
RNASEl, S0D2, TYMS and TFRC), H. pylori 
infection stomach cancer (TlMPl, MMP7 and 
MMP9), Small cell lung cancer and breast cancer 
(MAD2L1, TTK, UGT2B15, ADHIB, AURKA, 
BUBIB, COL18A1, GSTAl, GHR, APOE, 
IGFBP3, IL8, LEPR, MMP9, P0N2, SST, 
S0D2,TYMS, TFRC). Some of the differentially 
expressed genes reported in GCA were also 
dysregulated in a similar pattem as colorectal 



cancer, Small cell lung cancer and breast cancer 
suggests that despite their differences in cell 
types, these cancers likely share common genetic 
and/or environmental factors in their etiology. 
Such evidence for a common genetic influence is 
evident by results from a genome-wide 
association study which found a shared 
susceptibility locus in PLCEl, CDC25B and 
C0L1A2 for both GCA and ESCC (26,30). 
While progression of H.pylori-induced 
superficial gastritis to chronic atrophic gastritis, 
intestinal metaplasia and dysplasia is the main 
pathologic event in majority of GNCA, GCA 
show less prominent and even reveres 
relationship with H.pylori infection and 
subsequent atrophic gastritis. In contrast, recent 
study in Ardabil supported that H.pylori infection 
could be a putative risk factor for GCA. Because 
the incidence of GCA in the Ardabil region is 
higher than European countries, U. S. A. Japan 
and Korea (31) and more than 98% of people 40 
years old and above have been infected with 
H.pylori and almost all of them had H.pylori 
associated chronic gasfritis involving the antrum, 
corpus, and cardia that consequently up to 35% 
have concomitant corpus atrophic gastritis (32). 

Pathway Analysis Using DAVID 

The KEGG pathway analyses of the 
significantly expressed genes using DAVID are 
shown in tables 1 and 2. The GCA pattem versus 
confrol analysis revealed that different pathways 
were related to the following up-regulated 
expressed genes (Table I) that contained ECM- 
receptor interaction, cell cycle, focal adhesion, 
p53 signaling pathway (CCNBl, CCNB2, 
CCNEl, IGFBP3, RRM2, SERPINB5, SFN and 
THBSI), small cell lung cancer (E2F3, COL4AI, 
COL4A2, CCNEI, FNI and LAMBI), DNA 
replication, toll-like receptor signaling pathway, 
bladder cancer (E2F3, IL8, MMP9 and THBSI), 
oocj^e meiosis, leukocj^e frans endothelial 
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migration, systemic lupus erythematosus, 
pathways in cancer (E2F3, BIRC5, C0L4A1, 
COL4A2, CCNEl, FNl, IL8, LAMBl, LEFl and 
MMP9), complement and coagulation cascades. 
Over the past few decades, many studies have 
identified and characterized pathways within 
gastric cancer that the number of them associated 
up-regulated genes include the cell cycle, cellular 
growth and proliferation, cell cycle checkpoint, 
extracellular matrix remodeling, and 
angiogenesis (eg, Wnt signaling and cell cycle 
checkpoint pathways, such as SULFl, SFRP4, 
LEFl, T0P2A, and CDC2 (33-37)), and the 
integrin signaling pathway (ARPCIB, COLlAl, 
C0L4A1, FNl, and LAMBl). Some genes are 
also related to adaptive immune responses 
(eg, CD14) and tumor metastasis (eg, CD9). The 
most KEGG pathway according to table 2, were 
found in down-regulated genes contribute to 
pathways in metabolism of xenobiotics by 
cytochrome P450, drug metabolism, arginine and 
proline metabolism, butanoate metabolism, fatty 
acid metabolism, retinol metabolism, nitrogen 
metabolism, histidine metabolism, prion diseases, 
tryptophan metabolism, valine, leucine and 
isoleucine degradation, tyrosine metabolism and 
steroid hormone biosynthesis. 

The common down-regulated genes 
previously found in GCA are consistent with 
other studies on gastric cancer, such 
asAKRlBlO, ALDH3A1, ATP4B, CA2, 
IGFBP2, KLF4, MUC5AC, MUC6, TFFl, and 
TFF2 (35,38,39). Important pathways in down- 
regulated genes were mainly involved in 
metabolic pathways, digestive system 
development, or mucosal integrity. Several genes 
are thought to have specific fiinctions in gastric 
epithelium, such as PGC and GIF, implying that 
dedifferentiation is a common feature of 
carcinogenesis (26, 33). BUBlBis also a 
spindle-assembly checkpoint gene is a 
susceptibility gene for this gastric tumor (26). 



Table 1. The KEGG pathways related to the up- 
regulated expressed genes in GCA pattern 

KEGG Pathway Number % P-Value 
of genes 



Table 2. The KEGG pathways related to the down- 

regulated expressed genes in GCA pattern 

KEGG Pathway Number of % P- Value 



genes 



Metabolism of 


8 


0.6 


<0.0001 


xenobiotics by 








cytochrome P450 








Drug metabolism 


8 


0.6 


<0.0001 


Arginine and proline 


5 


0.3 


0.003 


metabolism 








Butanoate metabolism 


4 


0.3 


0.007 


Fatty acid metabolism 


4 


0.3 


0.011 


Retinol metabolism 


4 


0.3 


0.024 


Nitrogen metabolism 


3 


0.2 


0.029 


Histidine metabolism 


3 


0.2 


0.044 


Prion diseases 


3 


0.2 


0.062 


Tryptophan metabolism 


3 


0.2 


0.078 


Valine, leucine and 


3 


0.2 


0.092 


isoleucine degradation 








Tyrosine metabolism 


3 


0.2 


0.092 


Steroid hormone 


3 


0.2 


0.099 



biosynthesis 



Table 3. GO-discovered categories for the genes that 

up-regulated in the GCA pattern 

GO GO name No. of P-value 



ECM-receptor 


15 


8.2 


<0.0001 


interaction 








Cell cycle 


14 


7.7 


<0.0001 


Focal adhesion 


16 


8.8 


<0.0001 


p53 signaling 


8 


4.4 


<0.0001 


pathway 








Small cell lung 


6 


3.3 


0.011 


cancer 








DNA replication 


4 


2.2 


0.02 


Toll-like receptor 


6 


3.3 


0.023 


signaling pathway 








Bladder cancer 


4 


2.2 


0.03 


Oocyte meiosis 


6 


3.3 


0.03 


Leukocyte 


6 


3.3 


0.04 


transendothelial 








migration 








Systemic lupus 


5 


2.7 


0.075 


erythematosus 








Pathways in cancer 


10 


5.5 


0.08 


Complement and 


4 


2.2 


0.099 


coagulation 








cascades 
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constituent of down regulated expressed genes are 
a part of extracellular region, spindle, nuclear part 
and membrane. The genes that were uniquely 
down-regulated expressed between the GCA 
pattern and the control were as shown in Table 4, 
predominantly grouped into biological process, 
including oxoacid metabolic process, lipid 
metaboHc process, response to stimulus, response 
to reactive oxygen species, response to nutrient 
levels, response to drug. Functional classes 
contain mitochondrial part, oxidoreductase 
activity, cofactor binding, lipid binding, ion 
binding, DNA binding and signal transducer 
activity. Cellular constituent of down regulated 
expressed genes are a part of endocytosis, 
extracellular region, plasma membrane part, 
insoluble fraction, membrane, cytoplasm. All 
these up-regulated expressed genes are 
functionally related, or members of the same 
pathway or protein complex are able us to identify 
genes that share similar expression patterns across 
a variety of experimental conditions. 

For genes dysregulated significantly in GCA, 
we note a couple of interesting examples here. 
E2F3 (E2F transcription factor 3) showed up- 
regulated in GCA patients. E2F3, Transcription 
factor E2F/dimerisation partner (TDP), Winged 
helix repressor DNA-binding, play an essential 
role in most cancer pathway such as cell 
cycle, pancreatic cancer, glioma, prostate 
cancer, melanoma, bladder cancer, chronic 

myeloid leukemia, small cell lung cancer, non- 
small cell lung cancer (40,41). Other genes up- 
regulated significantly in GCA only, but not in 
GNCA is MADILI (MADl mitotic arrest 
deficient-like 1) play as cell cycle checkpoint 
present in different diseases like lymphoma, 
somatic, prostate cancer (42). 

Table 4. GO-discovered categories for the genes that 
down-regulated in the GCA pattern 



categories genes 

Biological process 



Cellular 



Molecular 



cell cycle 


31 


<0.0001 


regulation of cell cycle 


14 


<0.0001 


response to stimulus 


63 


<0.0001 


developmental process 


68 


<0.0001 


epidermis development 


9 


0.001 


collagen metabolic process 


7 


<0.0001 


intracellular signaling 


20 


0.12 


cascade 






cell cycle checkpoint 


8 


<0.0001 


extracellular structure 


13 


<0.0001 


organization 






response to nutrient 


7 


0.005 


response to vitamin 


5 


0.007 


apoptosis 


15 


0.009 


regulation of biological 


96 


0.016 


process 






response to DNA damage 


10 


0.027 


stimulus 






instituent 






extracellular region 


71 


<0.0001 


spindle 


14 


<0.0001 


nuclear part 


28 


0.059 


membrane 


72 


0.9 


function 






glycosaminoglycan 


14 


<0.0001 


binding 






enzyme regulator activity 


23 


<0.0001 


cytokine activity 


8 


0.005 


G-protein-coupled receptor 


4 


0.12 


binding 






transcription regulator 


11 


0.97 


activity 






kinase activity 


7 


0.89 



Gene ontology analysis using DAVID 

The GO-discovered categories using DAVID 
analysis for up-regulated expressed genes (table 3) 
between the GCA pattern and the control pattern 
comparisons were predominantly grouped into 
functional classes of glycosaminoglycan binding, 
enzyme regulator activity, cytokine activity, G- 
protein-coupled receptor binding, transcription 
regulator activity, kinase activity. The most 
important biological process are cell cycle, 
regulation of cell cycle, response to stimulus, 
developmental process, epidermis development, 
collagen metabolic process, intracellular signaling 
cascade and cell cycle checkpoint. Cellular 



GO GO name Number P_Value 
categories of genes 
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Biological process 






oxoacid 


19 


<0.0001 


metabolic process 






lipid metabolic 


17 


0.002 


process 






response to 


44 


0.009 


stimulus 






response to 


6 


<0.0001 


reactive oxygen 






species 






response to 


7 


0.008 


nutrient levels 






response to drug 


6 


0.042 


response to 


8 


0.003 


extracellular 






stimulus 






homeostatic 


15 


0.006 


process 






metabolic process 


81 


0.012 


biological 


62 


0.83 


regulation 






regulation of 


9 


0.41 


apoptosis 






endocytosis 


4 


0.3 


Cellular constituent 






extracellular 


42 


<0.0001 


region 






plasma 


28 


0.017 


membrane part 






insoluble fraction 


13 


0.04 


membrane 


66 


0.16 


cytoplasm 


75 


0.006 


mitochondrial 


6 


0.53 


part 






Molecular function 






oxidoreductase 


20 


<0.0001 


activity 






CO factor binding 


10 


<0.0001 


lipid binding 


12 


0.002 


ion binding 


47 


0.071 


DNA binding 


10 


1 


signal transducer 


13 


0.99 


activity 







RAP 1 GAP (RAPl GTPase activating protein) is 
down-regulated in OCA reveal six loci influencing 
plasma levels of liver enzymes in population-based 
genome-wide association studies (43). 

S0X9 (SRY (sex determining region Y)-box 9), 
transcription factor S0X-9,was up-regulated in GCA 
play cell morphogenesis in different levels and have 
critical roles in diseases mutation like campomelic 
dysplasia with autosomal sex reversal (44). 



ADA (adenosine deaminase) is down-regulated 
in GCA response to reactive oxygen species and 
response to hypoxia cause adenosine deaminase 
deficiency, partial, severe combined 

immunodeficiency due to ADA deficiency (45). 

AKR1C2 (aldo-ketoreductase family 1) is down- 
regulated in GCA has function in fatty acid 
metabolic process that cause obesity, hyperphagia, 
and developmental delay (46). 

Conclusion 

The most relationship between genes in GCA and 
other cancer cells was extracted by comparison 
disease pathways. Gene clustering based on GO 
biological process, fimctional classes and cellular 
components illustrate correlated functional 
expression patterns related cancer condition, which 
might, in turn, help us to identify new potential drug 
targets. On top of that, morphologically identical 
tumors can be distinct in their mutational patterns, 
signaling-pathway alterations and gene-expression 
profiles, and, most importantly, in their response to a 
range of therapies. Therefore, new predictive 
molecular diagnostics need to be developed and 
integrated with drug development and clinical-trial 
design. So far, there have been a few possible 
biomarkers for GCA. In this study introduced some 
biomarkers that might use for diagnosis, prognosis 
and treatment prediction. To unravel the possible 
role(s) of these proteins in GCA tumorigenesis, 
flirther investigations are needed. 
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